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Abstract 

Background: Controlling global epidemics in the real world and accelerating information propagation in 
the artificial world are of great significance, which have activated an upsurge in the studies on networked 
spreading dynamics. Lots of efforts have been made to understand the impacts of macroscopic statistics 
(e.g., degree distribution and average distance) and mcsoscopic structures (e.g., communities and rich 
clubs) on spreading processes while the microscopic elements are less concerned. In particular, roles of 
ties are not yet clear to the academic community. 

Methodology/Principle Findings: Every edges is stamped by its strength that is defined solely based 
on the local topology. According to a weighted susceptiblc-infected-susccptiblc model, the steady-state 
infected density and spreading speed are respectively optimized by adjusting the relationship between 
edge's strength and spreading ability. Experiments on six real networks show that the infected density is 
increased when strong ties are favored in the spreading, while the speed is enhanced when weak ties are 
favored. Significance of these findings is further demonstrated by comparing with a null model. 

Conclusions/Significance: Experimental results indicate that strong and weak ties play distinguishable 
roles in spreading dynamics: the former enlarge the infected density while the latter fasten the process. 
The proposed method provides a quantitative way to reveal the qualitatively different roles of ties, which 
could find applications in analyzing many networked dynamical processes with multiple performance 
indices, such as synchronizability and converging time in synchronization and throughput and delivering 
time in transportation. 

Introduction 

Early before the classification of social ties proposed, in 1954, the Russian mathematical psychologist 
Anatol Rapoport [1] has been aware of "well-known fact that the likely contacts of two individuals who 
are closely acquainted tend to be more overlapping than those of two arbitrarily selected individuals." 
This argument became one of the cornerstones of social network theory. In 1973, ties in social networks, 
generally, come in two varieties: strong and weak, which has been first proposed by the American 
sociologist Mark Granovettcr in Rcf. [2], one of the most influential sociology papers ever published. 
Different relationships can be measured in the currency of tie strength. According to the closeness, 
connections to close friends have been said to be "strong" tics, while those to acquaintances have been 
called "weak" ties [SHS]. Tie strength usually plays an vital role in many real networks and is crucial 
to understand dynamical processes on the networks [BHS]. Strong tics are the people you really trust, 
people whose social circles tightly overlap with your own. Often, they are also the people most like you. 
Weak ties, conversely, are merely acquaintances and often provide access to novel information [B]. Weak 
ties display an important bridging function [J , while strong ties are more likely to activated for the flow 
of referral information and more influential than weak ties [8]. In addition, weak ties could play a more 
signiflcant role than strong ties to keep the stability j9l, maintain the connectivity jlOj and uncover the 
missing information [TTj . 

In despite of the qualitative distinction between strong and weak ties, the tie strength could be 
quantitatively described by edge weight — the edges with high weights are considered to be strong. In a 
number of social networks, edges are often associated with weights that differentiate them in terms of their 
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strength, intensity, capacity or the frequency of recent contacts [3J[T2]. For non-social networks, weights 
often refer to the functions performed by edges, e.g., the amount of traffic flowing along connections in 
world-wide airport networks [I2j, the number of joint papers of two coauthors in scientist collaboration 
networks |12| . the number of synapses and gap junctions in neural networks |13] . the carbon flow between 
species in food webs [M]. Yan et al. [15] investigated the epidemic spreading in weighted scale- free 
networks and the simulation results indicated that the more homogeneous weight distribution of the 
network, the more quickly epidemic spreads on it. This finding was further demonstrated by an edge- 
based mean-field solution jl6j . Chu et al. [17| showed that the weight distribution has strong impacts 
on both epidemic threshold and prevalence. Baronchelli and Pastor-Satorras [H] considered the diffusive 
dynamics on weighted networks and shed light on the validity of mean-field theory on weighted networks. 
These studies revealed quantitative impacts of edges with different weights, but ignored the qualitatively 
different roles respectively played by strong and weak ties. 

In this paper, we propose a quantitative method to uncover the qualitatively different roles of strong 
and weak ties via a weighted susceptible-infected-susccptible (SIS) model. Experimental results on six 
real networks emphasize the different roles of strong and weak ties: the former improve the steady-state 
fraction of infected nodes while the latter fasten the spreading speed. 



Results 
Model 

The SIS model |19| is suitable to describe the cases when the individuals cannot acquire immunity after 
recovering from the disease, such as influenza, pulmonary tuberculosis and gonorrhea. With disease of this 
kind individuals that are cured usually catch again. In the networked SIS model, nodes are in two discrete 
states, "susceptible" or "infected" . Each infected node will contact all its neighbors once at each time 
step, and therefore the infectivity of each node is proportional to its degree. In the real world, individuals 
may be only able to contact limited population within one time step [20j . For example, salesman in 
network marketing processes will not make referrals to all his acquaintances due to the limited money 
and time }21j . In sexual contact networks, although a few individuals have hundreds of sexual partners, 
their sexual activities are not far beyond a normal level due to the physiological limitations [22 ( 123 ) . 
Therefore, in the present model we assume every individual has the same infectivity [2011^ . Without the 
lose of generality, at each time step, each infected node will select one of its neighbors to contact. If the 
selected neighbor has been infected already, nothing happens, while if it is susceptible, with probability 
a, it will be infected. Meanwhile, each infected node will become susceptible in the next time step with 
probability /3. The probability an infected node i will select its neighbor j is 

where is the neighbor set of node i, Sij denotes the tie strength between i and j, and 6 is a tunable 
parameter. If & = 0, the infected node randomly selects a neighbor to contact. If 6 > 0, strong ties are 
favored to constitute the paths of spreading, while if 6 < 0, weak ties are favored. 

In different contexts, the strength of a tie may have different definitions and measures [25l[26], which 
may depend on external information to network topology. For general networks, one may be not aware 
of external information and thus it is meaningful to give a natural definition solely based on network 
topology. According to Rapoport's theory [T] and other supportive observations [?7H^ and models 
[301I31[ . we define the tie strength stj in spite of i and j's common neighbors, as follows: 
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where n.y is the number of common neighbors of i and j, ki is the degree of i, and (5 is a constant that 
gives chance to the tie connecting two nodes without common neighbors. For simphcity, we set 5 — 1. 
The dynamical process starts with randomly selecting a certain number of infected nodes. 

Data 

To see the impacts of tie strength on the spreading dynamics, experiments are carried out on six real 
networks: (1) Facebook-like Social Network (FSN) [32]: it originates from an online community for 
students at University of California. The dataset includes the users that have sent or received at least 
one message, and an undirected edge is set between i and j if i has sent (received) an online message to 
(from) j. (2) Enron Email Network (EEN) [33l|34]: it covers about half million emails. Nodes of this 
network are email addresses and if an address i sent at least one email to address j , an undirected edge is 
established between i and j. (3) Slashdot Social Network (SSN) [35]: nodes in this network are the users 
in Slashdot, which is a technology-related news website, and edges represent friendships between users. 
(4) Epinions Social Network (ESN) [36]: this is a who-trust-whom online social network. Nodes are the 
members of the general consumer review site Epinions. com, and edges represent the trust relationships 
between two members. (5) Gnutella Peer-to-peer Network (GPN) [37]: Nodes represent hosts and edges 
stand for connections between the hosts. (6) Oregon Autonomous Systems (OAS) [38]: this is an AS- level 
Internet topology graph obtained by the Route Views Project. In order to guarantee the connectivity, 
we use the largest connected components of these networks. Table [T] presents the basic statistics of the 
largest connected components of the six networks. 

Results and Analysis 

All simulations start with 20 randomly selected nodes as infected sources, with infectivity rate a = 0.4 and 
recovery rate /3 = 0.1. Figure [T] gives us intuition of the spreading processes following the proposed model, 
in which one can obverse the number of infected nodes, /(t), changing with time, and considerable 
effects made by b. We next investigate 6's impacts on two macroscopic features of the spreading process — 
the prevalence and speed. The first one is characterized by the fraction of infected nodes in the steady 
state, p = I{oo)/N . For the second one, if we define A/(i) := /(oo) — /(i), it converges to zero following 
an exponential form (it is easy to be checked numerically and thus not shown here), A/(t) oc e""^*, and 
the parameter A is an informative index for the converging speed. The larger p the wider the spreading, 
and the larger A the faster the spreading. 

Figure [2] displays how the steady-state infection density p and the speed index A change with b based 
on the six real networks. For each case, there are well-defined peaks for both p{b) and A(6). More 
interestingly, in every case, all the "optimal" values of b subject to high prevalence are larger than zero, 
while all the "optimal" values of 6 subject to high speed are smaller than zero. These phenomena indicate 
the different roles of strong and weak ties: the former operate locally and keep a high prevalence in local 
surrounding while the latter bridge different densely connected groups and enhance the chance to infect 
healthy groups. As an imprecise metaphor to global epidemic spreading, the strong ties are similar to 
the urban buses and gathering places while the weak ties are similar to the intercity trains and airlines. 

To emphasize the significance of our findings, we further consider the null model, where strengths are 
completely redistributed. This randomization operation destroys the correlation between structure and 
strength. Figure [3] reports p{b) for real networks and the corresponding randomized networks. Different 
from the positive peaks for real networks, all peaks for randomized networks arc at 6 = 0. Figure |4] shows 
\{b) for real networks and the corresponding randomized networks, again, peaks for randomized networks 
are all at 6 = 0. The results are in accordance with the known conclusion [TB] that in the absence of 
correlation between structure and weight, the most homogeneous weight distribution leads to the widest 
and fastest epidemic spreading. Comparison between results from real and randomized eases suggests 
that the non-zero "optimal" values of the parameter b do reveal some significant features of reality. 
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Discussion 

We studied a weighted SIS model where an edge's weight is functionally dependent on its strength. 
Motivated by the transitivity in social network analysis j39j and the Rapoport's theory [T], we apply a 
common-neighborhood-based similarity index to quantify edge strength. By comparing the experimental 
results on six real networks and their counterpart randomized networks, we show that strong and weak 
ties play distinguishable roles in spreading dynamics: the former enlarge the infected density while the 
latter fasten the process. 

Note that, the optimal values of b subject to the infected density and spreading speed are far different, 
indicating that the "optimization" of an spreading process could have multiple performance indices and 
these indices may conflict to each other. Analogous examples are numerous: bypassing main intersections 
and arterial roads could enhance the network throughput while increase the delivering time especially 
in uncongested states |40j . and removing directed loops could enhance the synchronizability while slow 
down the synchronizing process j41j . Our method provides a quantitative way to reveal the qualitatively 
different roles of ties (also can be extended to deal with nodes) in such dynamical processes. 

We hope this work could contribute to the long journey towards fully understanding the relations 
between structural features and functional performances. Along this way, more experiments on disparate 
networks will benefit us, and we will look into benchmark network models to see whether some structural 
properties, such as the presence of degree-degree correlation and community structure, are responsible 
for the observations reported here. 
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Figure Legends 

FiglT] Spreading processes for different values of the parameter b. 

Figl2] Steady-state infected density p and speed index A for different b. 
Figl3] Comparing p{b) between real and randomized networks. 
FigHl Comparing X{b) between real and randomized networks. 

Tables 

Table 1. Basic characteristics of the largest connected components, where N and E 
represent the number of nodes and edges, respectively. 



Dataset 


FSN 


FEN 


SSN 


ESN 


GPN 


OAS 


N 


1,893 


33,696 


74,444 


75,868 


8,842 


10,670 


E 


13,835 


180,811 


382,456 


405,729 


31,837 


22,002 
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Figure 1. Spreading processes for different values of the parameter b. These six plots show 
how the number of mfected individuals, I{t), changes with time on the six real networks: (a) 
Facebook-like social network, (b) Enron email network, (c) Slashdot social network, (d) Epinions social 
network, (e) Gnutells peer-to-peer network, (f) Oregon autonomous systems. Results are obtained by 
averaging over 1000 independent realizations. 




Figure 2. Steady-state infected density p and speed index A for different b. The serial 
numbers (a)-(f) have the same meaning to those for figure 1. The black and blue curves represent the 
steady-state infected density and the speed index, respectively. The black and blue dash lines indicate 
the peaks of the corresponding curves. Results arc obtained by averaging over 1000 independent 
realizations. 
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Figure 3. Comparing p{b) between real and randomized networks. The serial numbers (a)-(f) 
have the same meaning to those for figure 1. The black and red curves/lines represent the cases for real 
and randomized networks, respectively. Results are obtained by averaging over 1000 independent 
realizations. 




Figure 4. Effects of correlations between tie strength and network structure on the speed 
to reach the steady state. The serial numbers (a)-(f) have the same meaning to those for figure 1. 
The black and red curves/lines represent the cases for real and randomized networks, respectively. 
Results are obtained by averaging over 1000 independent realizations. 



